Claims 



I cl J 1 - A method for directing a request involving an expected load to one server out 

of a plurality of servers, comprising the steps of: 
selecting a server; 

determining whether said selected server has remaining capacity to handle said 
expected load; and 

directing the request to said selected server, only if said server has remaining 
capacity to handle said expected load. 

t c2 I 2. The method of claim 1 , further comprising the step of: 

providing at least one token associated with each of the plurality of servers; and 
wherein said step of selecting a server includes the step of selecting at least one 
token associated with said server.; 

3. The method of claim 2, wherein a probability of selecting a token associated 
with said server differs from a probability of selecting a token associated with at 
least one other of the plurality of servers. 



r jj fc3 l 4 The method of claim 2, wherein said step of providing at least one token 

..' includes the step of:, 

JJ; providing a number of tokens associated with each of the plurality of servers, 

□ wherein said number is proportional to a the load limitation of each of said 

;E plurality of servers. 

f c4 J 5 Tne method of claim 4, further comprising the step of skewing the a 

probability of selection of a at least one token associated with said server, said 
skewed probability being disproportionate to said the number of tokens 
associated with said server. 

[c5] 6. The method of claim 2, wherein said step of providing at least one token 

includes the step of: 

providing a number of tokens associated with each of the plurality of servers, 
wherein said number is disproportionate to a load limitation of each of said 
plurality of servers and said number is at least partly based on a priority of each 
of the plurality of servers. 
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t c6 l 7. The method of claim 1 , further comprising the step of: 

changing said remaining capacity to reflect said expected load if said request is 
directed to said server. 

[ c7 l 8. The method of claim 1 , further comprising the step of: 

selecting another server if said server does not have remaining capacity to 
handle said expected load. 

[ c8 l 9- The method of claim 8, wherein said other server is part of the same set. 

t c9 -l 1 0.The method of claim 8, wherein said other server is part of a reserve set. 

I c1 °J 11- The method of claim 1 , further comprising the step of: 

resetting said remaining capacity for each time frame. 

0 t cl 1 ] 1 2 - A system for allocating requests among servers, comprising: 

tl a plurality of servers; 

£ a first memory divided into entries, with at least one entry associated with each 

1| server and including an indication of said server; 

^ a second memory divided into entries, with at least one entry associated with 
each server and including a representation of a remaining capacity of said 

V 

z. server; and 

»S!.S' 

f a selector for selecting from among said entries of said first memory. 

t cl 2 1 1 3. The system of claim 1 2, further comprising: 

at least one other set of at least one server to which requests can be allocated if 
there is no remaining capacity in any of said plurality of servers. 



[cl3] 



14. A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform method steps for 
directing a request involving an expected load to one server out of a plurality of 
servers, comprising the steps of: 
selecting a server; 

determining whether said selected server has remaining capacity to handle said 
expected load; and 

directing the request to said selected server, only if said server has remaining 
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capacity to handle said expected load. 

[cl 4] 1 5. A computer program product comprising a computer useable medium 

having computer readable program code embodied therein for directing a 
request involving an expected load to one server out of a plurality of servers, 
the computer program product comprising: 

computer readable program code for causing the computer to select a server; 
computer readable program code for causing the computer to determine 
whether said selected server has remaining capacity to handle said expected 
load; and 

computer readable program code for causing the computer to direct the request 
to said selected server, only if said server has remaining capacity to handle said 
expected load. 
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